3574 results found.
Written
Representation-Annotation Standards/Best Practices,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons
Size:
501 KByte Production Status:
Existing-used
Use:
Discourse
-
Paper title:Multi-label Annotation in Scientific Articles - The Multi-label Cancer Risk Assessment Corpus
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | James Ravenscroft | University of Warwick | GB |
| Author 2 | Anika Oellrich | King's College London | GB |
| Author 3 | Shyamasree Saha | Queen's College London | GB |
| Author 4 | Maria Liakata | University of Warwick | GB |
| Main Contact | Maria Liakata | University of Warwick | None |
Documentation:
<Not Specified>
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Apache
Size:
2133 sounds OtherProduction Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:The VU Sound Corpus: Adding More Fine-grained Annotations to the Freesound Database
-
Paper track:Multimodality
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Emiel van Miltenburg | Vrije Universiteit Amsterdam | NL |
| Author 2 | Benjamin Timmermans | Vrije Universiteit Amsterdam | NL |
| Author 3 | Lora Aroyo | Vrije Universiteit Amsterdam | NL |
| Main Contact | Emiel van Miltenburg | Vrije Universiteit Amsterdam | None |
Documentation:
English README available on the GitHub page, otherwise see our paper.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons (CC BY-SA)
Size:
120 KByte Production Status:
Newly created-finished
Use:
Text Mining
-
Paper title:Homing in on Twitter Users: Evaluating an Enhanced Geoparser for User Profile Locations
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Beatrice Alex | University of Edinburgh, School of Informatics | GB |
| Author 2 | Clare Llewellyn | University of Edinburgh | GB |
| Author 3 | Claire Grover | University of Edinburgh | GB |
| Author 4 | Jon Oberlander | University of Edinburgh | GB |
| Author 5 | Richard Tobin | University of Edinburgh, School of Informatics | GB |
| Main Contact | Beatrice Alex | University of Edinburgh, School of Informatics | None |
Documentation:
In progress but some description is already provided at the URL provided.
Written
Lexicon,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
12000 lexemes Production Status:
Existing-used
Use:
FrameNet-based Shallow Semantic Parsing
-
Paper title:Developing a French FrameNet: Methodology and First results
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Marie Candito | Univ Paris Diderot - INRIA - Alpage | FR | ||
| Author 10 | Philippe Muller | IRIT, Toulouse University | FR | ||
| Author 11 | Benoît Sagot | Inria | FR | ||
| Author 12 | Laure Vieu | IRIT-CNRS-Toulouse University | FR | ||
| Author 2 | Pascal Amsili | LLF (Univ Paris Diderot / CNRS) | FR | ||
| Author 3 | Lucie Barque | LDI, Univ Paris 13 | FR | ||
| Author 4 | Farah Benamara | IRIT, Toulouse University | FR | ||
| Author 5 | Gaël de Chalendar | CEA LIST | FR | ||
| Author 6 | Marianne Djemaa | Alpage (Univ Paris Diderot / INRIA) | FR | ||
| Author 7 | Pauline Haas | LDI, Univ Paris 13 | FR | ||
| Author 8 | Richard Huyghe | CLILLAC-ARP (Univ Paris Diderot) | FR | ||
| Author 9 | Yvette Yannick Mathieu | LLF (Univ Paris Diderot / CNRS) | FR | ||
| Main Contact | Marie Candito | Univ Paris Diderot - INRIA - Alpage | None | LLF (Univ Paris Diderot / CNRS) | None |
Documentation:
Yes, in English
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
1.8 million articles OtherProduction Status:
Existing-used
Use:
Discourse
-
Paper title:A corpus of general and specific sentences from news
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Annie Louis | University of Pennsylvania | None |
| Author 2 | Ani Nenkova | University of Pennsylvania | None |
| Main Contact | Annie Louis | University of Pennsylvania | US |
Documentation:
yes, english, http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2008T19
Written
Corpus,
Language Type:
Multilingual
Languages:
English Spanish
Availability:
Freely Available
License:
<Not Specified>
Size:
1965734 sentences Production Status:
Existing-used
Use:
Opinion Mining/Sentiment Analysis
-
Paper title:Exploring Distributional Representations and Machine Translation for Aspect-based Cross-lingual Sentiment Classification.
-
Paper track:Under-resourced Languages
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Jeremy Barnes | Universitat Pomeu Fabra | ES | ||
| Author 2 | Patrik Lambert | UPF | FR | WebInterpret | N/A |
| Author 3 | Toni Badia | Universitat Pompeu Fabra | AD | ||
| Main Contact | Jeremy Barnes | Universitat Pomeu Fabra | None |
Documentation:
<Not Specified>
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
MIT
Size:
2.5 GByte Production Status:
Newly created-in progress
Use:
Machine Learning
-
Paper title:A Corpus of Images and Text in Online News
-
Paper track:Multimodality
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Laura Hollink | CWI | NL |
| Author 2 | Adriatik Bedjeti | Centrum Wiskunde & Informatica | NL |
| Author 3 | Martin van Harmelen | Centrum Wiskunde & Informatica | NL |
| Author 4 | Desmond Elliott | University of Amsterdam | NL |
| Main Contact | Laura Hollink | CWI | None |
Documentation:
Yes, publicly available, in EnglishLanguage Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Apache License 2.0
Size:
3.44 MByte Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Identifying Content Types of Messages Related to Open Source Software Projects
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Yannis Korkontzelos | Department of Computing, Edge Hill University | GB | National Centre for Text Mining, The University of Manchester | GB | ||
| Author 2 | Paul Thompson | <Not Specified> | None | National Centre for Text Mining, The University of Manchester | GB | University of Manchester - NaCTeM | None |
| Author 3 | Sophia Ananiadou | University of Manchester | GB | ||||
| Main Contact | Yannis Korkontzelos | Department of Computing, Edge Hill University | None |
Documentation:
Documentation in english is included in the xml file.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
LGPL
Size:
64 Production Status:
Newly created-in progress
Use:
Documentation and regression testing
-
Paper title:Towards an Encyclopedia of Compositional Semantics: Documenting the Interface of the English Resource Grammar
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Dan Flickinger | Stanford | US |
| Author 2 | Emily M. Bender | University of Washington | US |
| Author 3 | Stephan Oepen | Universitetet i Oslo | NO |
| Main Contact | Stephan Oepen | Universitetet i Oslo | None |
Documentation:
an emerging collection of wiki pagesLanguage Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
None because the dataset is not published now
Size:
199 KByte Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:Composing Distributed Representations of Relational Patterns
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Poster - Tuesday
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Sho Takase | Tohoku University | JP |
| Author 2 | Naoaki Okazaki | Tohoku University | JP |
| Author 3 | Kentaro Inui | Tohoku University | JP |
| Main Contact | Sho Takase | Tohoku University | None |
Documentation:
We prepared an english documentation as below




